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Identification of the NF-kB activating protein-like 
locus as a risk locus for rheumatoid arthritis 

Gang Xie/'^ Yue Lu,^ Ye Sun/'^ Steven Shiyang Zhang,^ Edward Clark Keystone,^ 
Peter K Gregersen,^ Robert M Plenge,^ Christopher I Amos,^ 
Katherine A Siminovitch^'^'^ 



ABSTRACT 

Objective To fine-map the NF-kB activating protein-like 
(NKAPL) locus identified in a prior genome-wide study as 
a possible rheumatoid arthritis (RA) risk locus and 
thereby delineate additional variants with stronger and/or 
independent disease association. 
Methods Genotypes for 101 SNPs across the NKAPL 
locus on chromosome 6p22.1 were obtained on 1368 
Canadian RA cases and 1471 controls. Single marker 
associations were examined using logistic regression and 
the most strongly associated NKAPL locus SNPs then 
typed in another Canadian and a US-based RA case/ 
control cohort. 

Results Fine-mapping analyses identified six NKAPL 
locus variants in a single haplotype block showing 
association with p<5.6x10~^ in the combined Canadian 
cohort. Among these SNPs, rs35656932 in the zinc 
finger 193 gene and rsl 3208096 in the NKAPL gene 
remained significant after conditional logistic regression, 
contributed independently to risk for disease, and were 
replicated in the US cohort (Pcomb=4.24x10"^° and 
2.44x10"^, respectively). These associations remained 
significant after conditioning on SNPs tagging the HLA- 
shared epitope (SE) DFIB1*0401 allele and were 
significantly stronger in the HLA-SE negative versus 
positive subgroup, with a significant negative interaction 
apparent between l-ltA-DRBI SE and NKAPL risk alleles. 
Conclusions By illuminating additional NKAPL variants 
with highly significant effects on risk that are distinct 
from, but interactive with those arising from the l-IU\- 
DHB1 locus, our data conclusively identify NKAPL as an 
RA susceptibility locus. 



INTRODUCTION 

Rheumatoid arthritis (RA) is a chronic auto- 
immune disease primarily associated with inflam- 
mation of the synovial joints and affecting up to 
1% of the population worldwide. Although the 
complex interplay of genetic and environmental 
factors underpinning RA are not well understood, 
major inroads have been made in mapping gene 
loci associated with risk for this disease."^ In add- 
ition to the HLA-DRB locus, over 35 non-major 
histocompatibility complex (MHC) RA risk loci 
have emerged from genome-wide association 
studies (GV\AS) and subsequent GV\AS 
meta-analyses of the GV\AS datasets.^""^^ 

Through a genome- wide scan of 2418 RA patients 
and 4504 healthy controls ascertained in Canada and 
the USA, we previously identified an association of 



RA with the REL NF-kB transcription factor locus 
and also confirmed already identified disease associa- 
tions with the PTPNZZ, CTLA4, TNFAIP3, ELK and 
TRAF1/C5 genes.^ Our data also showed strongly 
suggestive signals (Pqwas values between 8.2x10"^ 
and 5.28x10"^) emanating from a cluster of Single 
nucleotide polymorphisms (SNP) across a 70 kb 
region on chromosome 6p22.1 encompassing the 
NF-kB activating protein-like gene (NKAPL) as well 
as three Zinc finger protein transcription factors 
ZNF19S, ZNF307 and ZNF187. A follow-up GWAS 
meta-analysis of 5505 RA and 22 603 controls of 
European descent also revealed 13 genotyped or 
imputed SNPs across a 150 kb region encompassing 
the NKAPL locus to be strongly associated (Pmeta 
between 1x10"^^ and 1x10"^^) with RA (R Plenge, 
personal communication). Although NKAPL func- 
tions are unknown, its protein product shows 90% 
sequence similarity to NF-kB activating protein 
(NKAP), a protein implicated in A(F-/cS-mediated 
transcriptional activation of TNF and IL-l."^"^ As data 
from our group and others have also implicated 
other genes from the NF-kB signalling pathway (eg, 
REL, CD40, TRAFi TNFAIP3, PRKCQ and 
TNFRSFU) in RA susceptibility^ the NKAPL 
gene represents a compelling potential candidate 
gene for RA. We therefore undertook fine-mapping 
studies of the NKAPL locus aimed at confirming this 
association, identifying those variants providing the 
strongest association signal and defining whether 
such variants act together or independently of one 
another and/or the FLLA-DRB"! locus in conferring 
risk for RA. 

METHODS 
Subjects 

Study cohorts (see online supplementary methods) 
include: 3979 subjects of European origin (2078 RA 
patients and 1901 healthy controls) recruited inde- 
pendently from two clinical centres in Canada, 
Toronto (1368 cases and 1471 healthy controls) 
and Halifax (710 cases and 430 healthy controls) 
and a third cohort including 2064 subjects of 
European ancestry ascertained in the USA as part 
of the Brigham Rheumatoid Arthritis Sequential 
Study and used here for replication analysis. 

SNP selection 

SNPs from a 372 kb interval across the NKAPL 
locus on chromosome 6p22.1 were selected primar- 
ily based on at least one of the following criteria: 
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(1) HapMap phase III data identifying the SNP as a tag SNP 
with minor allele frequency >0.01 and r^ threshold of 0.8 or 

(2) localisation within 150 kb upstream or downstream of 
SNPs most significantly associated with RA in our GWA.S. 
Other SNPs studied were: the autoimmune disease-associated 
PTPNZZ rs2476601^'^ and two SNPs (rs660895 and rs6910071) 
that tag the HLA-DRBl*0401 allele on chromosome 6p21.3.^^ 

Statistical analyses 

Hardy-Weinberg equilibrium^ allelic association and conditional 
logistic regression analyses were performed using PUNK soft- 
ware VI. 07 (http://pngu.mgh.harvard.edu/purcell/plink/). For 
the allelic association tests, the threshold for declaring signifi- 
cance was assigned according to Benjamini and Hochberg^s False 
Discovery Rate method and set at p<5.00xl0"'^ (0.05/101). 
Cochran-Mantel Haenszel ^ analysis was used to combine p 
values and calculate OR from the Canadian and US cohorts and 
an R-script (http://www.rproject.org/) was used to generate 
figures. Haplotype block structure, depicted using Haploview 
software V4.1 (http://www.broad.mit.edu/mpg/haploview), 
was defined according to the criteria established by Gabriel"^^ 
and the pairwise estimates of standardised Lewontin^s disequi- 
librium coefficient (D'), whereas the linkage disequilibrium (LD) 
among pairs of SNPs was characterised according to the square 
of the correlation coefficient (r^). Conditional logistic regression 
analyses of multiple markers were performed using SAS V9.13 
(SAS Institute Inc., Cary North Carolina, USA). Gene-gene 
interaction analysis was performed by case-only interaction ana- 
lysis in which a logistic regression model was used to test for an 
association of shared epitope (SE) positivity with NKAPL risk 
alleles (coded in an additive fashion as -1, 0 or 1 for no, 1 or 2 
risk alleles, respectively). For multinomial logistic regression 



modelling, controls were considered as the lowest risk 
outcome, SE negative cases as the intermediate risk outcome 
and SE positive cases as the highest risk outcome, and these mul- 
tiple outcomes were then assessed according to number of 
NKAPL risk alleles. The statistical power for this study was 
evaluated using CaTS software (http://www.sph.umich.edu/ 
csg/abecasis/CaTS/) with the following parameters: disease 
prevalence 0.01, disease allele frequency 0.2, a=0.0005 (0.05/ 
101). Power to detect associations with relative risk of 1.5 was 
estimated to be 99.4%. 



RESULTS 

Fine-mapping of the RA-associated NKAPL locus at 6p22.1 

To identify risk allele(s) at the NKAPL locus, we genotyped 1368 
RA cases and 1471 controls from Toronto for 105 SNPs across a 
372 kb genomic region encompassing the NKAPL gene. 
Characteristics of the study design and subjects are outlined 
in online supplementary figure SI. Among the 101 SNPs that 
passed quality control, 16 achieved the set significance threshold 
of p<5.00xl0~^ with the top six markers showing associations 
with disease (p<6. 00x10"^) that remained highly significant 
(p values 1.80xl0"^-8.60xl0"^) after False Discovery Rate cor- 
rection (table 1 and online supplementary table SI). Haploview 
analysis of pairwise LD among the 101 SNPs revealed that these 
six most strongly associated SNPs map within a 70 kb segment 
representing the middle of three haplotype blocks across this 
region and containing the NKAPL gene and three zinc finger tran- 
scription factor genes, ZNF19S, ZNF307 and ZNF187 (figure 1 
and see online supplementary figure S2). The strongest associ- 
ation signal (p=2.48xl0~^) came from a ZNF187 intronic SNP 
(rs67998226) at the distal end of this haplotype block, but these 
variants were all in strong LD with one another, the pairwise LD 



Table 1 List of the six NKAPL locus SNPs showing the most significant association with rheumatoid arthritis in the Toronto- and Halifax-based 
case/control cohorts. 











RAF 












SNP 


Gene 


Risk allele 


Sample cohort 


Case 


Control 


OR (95% CI) 


p Value* 


Pfdr value 


Cochran Q p valuet 


rs13195291 


ZNF193 


A 


Toronto 


0.115 


0.075 


1.61 (1.33 to 1.94) 


5.03x10-^ 


8.60x10-^ 










Halifax 


0.116 


0.086 


1.40 (1.04 to 1.89) 


0.03 


0.04 










Combinedt 


0.116 


0.077 


1.56 (1.33 to 1.82) 


2.79x10-^ 


4.20x10-^ 


0.45 


rs35656932 


ZNF193 


T 


Toronto 


0.118 


0.075 


1.66 (1.38 to 1.99) 


3.82x10-^ 


1.80x10-^ 










Halifax 


0.119 


0.083 


1.48 (1.11 to 1.99) 


8.31x10-^ 


0.02 










Combined 


0.118 


0.075 


1.62 (1.39 to 1.89) 


7.34x10-^° 


2.20x10-^ 


0.52 


rsl 320401 2 


ZNF193 


A 


Toronto 


0.116 


0.075 


1.63 (1.35 to 1.95) 


1.53x10-^ 


3.90x10-^ 










Halifax 


0.11 


0.088 


1.29 (0.96 to 1.74) 


0.09 


0.10 










Combined 


0.114 


0.078 


1.53 (1.31 to 1.79) 


5.66x10-^ 


5.60x10-^ 


0.20 


rsl 7720293 


ZNF307 


T 


Toronto 


0.135 


0.089 


1.60 (1.35 to 1.90) 


5.27x10-^ 


1.80x10-^ 










Halifax 


0.129 


0.107 


1.23 (0.94 to 1.60) 


0.139 


0.14 










Combined 


0.133 


0.093 


1.49 (1.29 to 1.72) 


4.29x10-^ 


5.10x10-^ 


0.10 


rsl 3208096 


NKAPL 


G 


Toronto 


0.115 


0.075 


1.61(1.34 to 1.94) 


3.69x10-^ 


7.50x10-^ 










Halifax 


0.116 


0.081 


1.48 (1.10 to 1.99) 


8.75x10-^ 


0.02 










Combined 


0.115 


0.076 


1.58 (1.35 to 1.85) 


6.99x10-^ 


1.40x10-^ 


0.64 


rs67998226 


ZNF187 


C 


Toronto 


0.122 


0.077 


1.66 (1.39 to 1.99) 


2.48x10-^ 


1.80x10-^ 










Halifax 


0.122 


0.086 


1.47 (1.10 to 1.97) 


9.88x10-^ 


0.02 










Combined 


0.122 


0.079 


1.62 (1.39 to 1.88) 


6.53x10-^° 


2.20x10-^ 


0.48 



Genotype data are shown for six SNPs tested in 1368 cases and 1471 controls from Toronto in step 1 and replication in 710 cases and 430 controls from Halifax in step 2. Data 
for all other SNPs tested in Toronto cohort are shown in online supplementary table 81. 

^Nominal p value from an allele-based case-control comparison with 1 degree of freedom; p<0.05 are highlighted in bold type. Pfdr corresponds to p values adjusted for 
multiple testing using the False Discovery Method implemented in PLINK. 

tGenotyping data from the Toronto and Halifax cohorts were merged in PLINK and combined Pcan values for association evaluated using Cochran-Mantel Haenszel tests and the 
Cochran Q test for heterogeneity. 

FDR, False Discovery Rate; NKAPL, NF-kB activating protein-like; RAF, risk allele frequency. 
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Figure 1 Case-control association results of SNPs (single nucleotide polymorphisms) at the NF-kB activating protein-like (NKAPL) locus on 6p22.1. 
SNPs genotyped in the fine mapping (♦) analysis of the Toronto-based case/control cohort. Results of combined analyses for the rs35656932 and 
rsl 3208096 are shown (■). Positions of the genes across the NKAPL locus are shown in the middle panel. The lower panel shows the linkage 
disequilibrium relationships between 12 of the most strongly associated SNPs (p<5.0x10~^) across an ^-70 kb region encompassing the ZNF193, 
ZNF307, NKAPL and ZNF187 genes. The numbers shown in the squares in the lower panel are r^ values representing the correlation coefficient for a 
given marker pair. The shade intensity lightens in parallel with diminishing correlation coefficient (r^) values (r^=1: black, r^=0: white). 



(r^) between rs67998226 and either the three SNPs across the 
more proximal ZiVf^^3 gene (rsl3195291, rs35656932 and 
rsl3204012) or the NKAFl promoter region SNP (rsl3208096) 
being >0.94 and 0.83, respectively, and between rs35656932 and 
rsl3208096 being 0.87. 

To further examine effects of this locus on RA susceptibility; 
the six most significant SNPs were also typed in a 
Halifax-derived cohort including 710 RA patients and 430 con- 
trols. Four of the six associations (rsl3 195291, rs35656932, 
rsl3208096 and rs67998226) were replicated in this cohort, 
with combined (Toronto and Halifax) association signals 
(Pcan) ranging from 5.60x10"^ to 2.22x10"^ (table 1). 

The extent to which the signals observed in the combined 
analysis were independent of one another was next examined 



using stepwise logistic regression analysis wherein variables 
were iteratively added into an empty model. This analysis iden- 
tified the ZNF193/NKAPL locus rs35656932 SNP, which is in 
strong LD (r^=0.96) with the ZNF187 rs67998226 SNP as the 
variant most strongly associated with risk for RA, but also sug- 
gested that both NKAPL rsl3208096 and rs3656932 SNP alleles 
influence risk for RA (table 2). Additional conditional analyses 
of the six SNPs revealed that both rs35656932 and rsl3208096 
remain significantly associated with disease after conditioning 
on each marker (see online supplementary table S2). Because 
these data suggest independent contributions of alleles of 
rs35656932 and rsl3208096 on risk, associations of these two 
variants with disease were next explored in a third (US) cohort 
including 863 cases and 1201 controls. The two SNPs both 
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Table 2 Stepwise logistic regression in the two SNP models 



SNP added to the model 



p Value 



OR (95% CI) 



rs35656932 
rsl 3208096 



0.003 
0.030 



5.82 (1.82 to 18.58) 
3.57 (1.12to 11.11) 



Stepwise logistic regression results based on analysis of the six most strongly 
associated NKAPL locus SNPs in the 2078 RA cases and 1901 controls from 
Canada. 



replicated in this cohort (table 3): combined analyses of all 2941 
cases and 3102 controls typed for these SNPs (table 3) yielding 
highly significant Pcomb values (4.24x10"^^ for rs35656932 and 
2.44x10-^ for rsl3208096). 

Variants of the NKAPL locus and the HLA region jointly 
contribute to risk for RA 

Because the NKAPL locus maps to chromosome 6p22.1^ evalu- 
ation of its disease association may be confounded by effects of 
the HLA-DRB"! locus at 6p21.3^ a locus which encodes the 
RA-predisposing SE^ confers much of the genetic risk for RA^ 
and maps in a region of extensive LD. ~ Although the 
NKAPL gene lies 4306 kb upstream of the RA risk-related 
HLA-DRBl gene, the extent to which its association with RA 
may reflect LD with HLA-DRB^l risk alleles was investigated by 
genotyping the Canadian cases (2078) and controls (1901) for 
two SNPs (rs660895 and rs69 10071) that tag one of the most 
common SE encoding alleles^ HLA-DRBl*0401 , reassessing the 
A/7<API-disease association by logistic regression analysis with 
conditioning on the HLA-DRB1*0401 SNPs. This analysis con- 
firmed the strong association of RA with both tag SNPs^ 
the signals reaching p=2.04xl0"^^ and p=2.39xl0"^^ for 
rs660895 and rs691007l7 respectively (see online supplementary 
table S3). Importantly the association signals from each of the 
six NKAPL locus SNPs remained highly significant (p<lxlO~^^) 
after conditioning on the DRBl tag SNPs (see online supplemen- 
tary table S4). Analyses of the pairwise LD between the two tag 
SNPs and each of the 55 SNPs with nominal evidence (p<0.05) 
for disease association in the initial analysis of Toronto controls 
also revealed no sig nificant LD (r^<0.01 for each SNP pair) 
between any of these SNPs and the HLA-DRBl tag SNPs (data 
not shown). These findings imply that the association signal at 
the NKAPL locus represents an effect on risk that is not attribut- 
able to LD with the HLA-DRBr 0401 allele. 

To further evaluate the relationship between NKAPL and HLA 
locus effects on risk, we additionally used data from a large 
panel of British subjects genotyped for HLA alleles to impute, in 
the subset of the Canadian cohort included in our previous 
GWAS,^ 137 HLA alleles encoding classic HLA-A, B, C, DRBl, 



DQA and DQB molecules. Association tests were performed in a 
dataset combining the imputed alleles, and the GWAS-derived 
HLA region as well as the fine mapping-derived SNP genotypes. 
Single SNP association tests performed using this combined 
dataset and assuming an additive model (implemented in PLINK 
software) identified 379 SNPs (data not shown) or HLA geno- 
types with p values less than 1.0 xlO""^ (see online supplemen- 
tary table S5). These variants together with the six candidate 
SNPs identified by fine mapping were subjected to stepwise 
logistic regression analysis using SAS and a p value set at <0.01 
as the criterion to enter and remain in the model. From this ana- 
lysis, seven SNPs (including rs35656932), but no classical HLA 
alleles were retained in the model, suggesting that risk for 
disease at this locus is better explained by effects of SNPs 
rather than HLA alleles (see online supplementary table 
S6A). Stepwise logistic regression was then repeated with 
HLA-DRBl* 0401 forced into the model. After this analysis, five 
variants, including HLA-DRB1*0401 (p=0.04), were retained in 
the model, although this HLA allele did not reach our signifi- 
cance criterion of p<0.01 (see online supplementary table S6B). 
These data provide further evidence that NKAPL variant (s) per 
se contribute to risk for RA. 

Relevance of HLA genotypes to the NKAPL locus effect on 
risk for RA was also explored by assessing the extent to which 
the NKAPL region rs35656932 and rsl3208096 SNPs associate 
with disease in cases stratified based on presence or absence of 
the SE alleles or of anticyclic citrullinated peptide (anti-CCP) 
antibody, an autoantibody strongly associated with SE alleles. 
This analysis revealed the disease associations for both risk 
alleles to be much higher in the SE negative (p= 1.20x10"^^ for 
rs35656932 and 3.50x10"^^ for rsl3208096) than in the SE 
positive (p=4. 70x10"^ and 8.10x10"^, respectively) subgroup 
(table 4). By contrast, the association signals from these loci 
were essentially the same in the anti-CCP positive and negative 
subsets. Because a disparity between SE and anti-CCP status 
effects on this association was not expected, the stratified sub- 
groups were also genotyped for the PTPNZZ gene rs2476601 
SNP for which an association with RA susceptibility is well 
established and thought to correlate with SE and anti-CCP 
positivity.^^"^^ This analysis confirmed the strong association 
of RA with the rs2476601 variant (p=2.72xl0"^^; see online 
supplementary table S3) as well as the positive effects of the 
SE (p=1.40xl0"^^ SE positive vs 2.40x10"^ in SE negative 
cases)) and of anti-CCP antibody (p=1.50xl0"^^ in anti-CCP 
positive vs 5.10x10"^ in anti-CCP negative cases)) on this asso- 
ciation. Thus, the PTPN22 risk allele is associated with SE posi- 
tivity in an RA population in which NKAPL effects on risk are 
primarily observed in SE negative disease. 



Table 3 Associations of the NKAPL locus rsl 3208096 and rsl 3708096 SNPs with rheumatoid arthritis in the combined Canadian and US cohort 









RAF 










SNP 


Gene Risk allele 


Sample cohort 


Case 


Control 


OR (95% CI) 


Pcomb 


Meta Q p value 


rs35656932 


ZNF193 T 


Toronto 


0.118 


0.075 


1.66 (1.38 to 1.99) 


3.82x10-^ 








Halifax 


0.119 


0.083 


1.48 (1.11 to 1.99) 


8.31x10-^ 








USA 


0.088 


0.070 


1.29 (1.02 to 1.62) 


0.032 








Combined* 


0.109 


0.074 


1.50 (1.32 to 1.71) 


4.24x10-^° 


0.23 


rsl 3208096 


NKAPL G 


Toronto 


0.115 


0.075 


1.61 (1.34 to 1.94) 


3.69x10-^ 








Halifax 


0.116 


0.081 


1.48 (1.10 to 1.99) 


8.75x10-^ 








USA 


0.086 


0.068 


1.30 (1.03 to 1.63) 


0.028 








Combined 


0.107 


0.073 


1.48 (1.32 to 1.69) 


2.44x10-^ 


0.35 



^Combined p values for association (Pcomb) were performed using Cochran-Mantel Haenszel tests stratifying by centre. The Cochran Q tests for heterogeneity in the effect 
among studies. 

NKAPL, NF-kB activating protein-like; RAF: risk allele frequency. 
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Table 4 Analysis of NKAPL and PTPN22 risk allele disease associations in rheumatoid arthritis patient subgroups 













RAF 








CMD 


Type 


Cases 


Controls 


Risk allele 


Case 


Control 


p Value 


UK (95 /0 LI) 


rs35656932 


SE- 


510 


1901 


T 


0.150 


0.077 


1.20x10-^^ 


2.1 (1.7 to 2.6) 




SE+ 


1376 


1901 




0.105 


0.077 


4.70x10-^ 


1.4 (1.2 to 1.7) 




Anti-CCP- 


402 


1901 




0.128 


0.077 


1.70x10-^ 


1.8(1.4 to 2.3) 




Anti-CCP + 


913 


1901 




0.112 


0.077 


3.00x10-^ 


1.6 (1.3 to 1.9) 


rsl 3208096 


SE- 


510 


1901 


G 


0.145 


0.076 


3.50x10-" 


2.1 (1.7 to 2.6) 




SE+ 


1376 


1901 




0.105 


0.076 


8.10x10-^ 


1.4 (1.2 to 1.7) 




Anti-CCP- 


402 


1901 




0.128 


0.076 


3.70x10-^ 


1.8 (1.4 to 2.3) 




Anti-CCP + 


913 


1901 




0.112 


0.076 


1.80x10-^ 


1.5 (1.3 to 1.9) 


rs2476601 


SE- 


510 


1901 


A 


0.126 


0.093 


2.40x10-^ 


1.4 (1.1 to 1.7) 


(PTPN22) 


SE+ 


1376 


1901 




0.162 


0.093 


1.40x10-^^ 


1.9(1.6 to 2.2) 




Anti-CCP- 


402 


1901 




0.116 


0.093 


5.10x10-^ 


1.3 (1.0 to 1.6) 




Anti-CCP + 


913 


1901 




0.155 


0.093 


1.50x10-" 


1.8(1.5 to 2.1) 



Data for subphenotype analyses were available for 1886 Canadian cases. 
^Versus healthy controls. 

CCR cyclic citrullinated peptide; NKAPL, NF-kB activating protein-like; RAF: risk allele frequency; SE, presence of shared epitope encoding DRB1*0Wl *0102,*040l*0404, 
m05,m08, m09, ""WOI or *7402 alleles. 



Effects of the NKAPL locus on risk were the strongest in the 
SE negative patients and increased when conditioning on 
HLA-DRBl *0401 tag SNPs, suggesting interaction between 
NKAPL and the HLA-DRBl risk alleles. This possibility was 
directly examined using case-only logistic regression models 
wherein SE positivity was considered the outcome and NKAPL 
genotype an additive effect. This analysis (table 5) revealed sig- 
nificant interactions between risks conferred by HLA-DRBl 
alleles and the NKAPL rs35656932 and rsl3208096 SNPs, with 
the interaction effect being negative for both the latter SNPs 
(OR=0.67/CI 0.54 to 0.83 and OR=0.69/CI 0.55 to 0.86, 
respectively). By comparison, a positive interaction effect was 
apparent between the RA-associated PTPNZZ rs2476601 variant 
and SE positivity (OR=1.34, CI 1.08 to 1.65). Results of multi- 
nomial logistic regression analyses (see online supplementary 
table S7) further confirmed strong interaction of the HLA-DRBl 
SE alleles with each of the NKAPL (p=4.35xl0"^^ and 
2.51x10-^^) and the PTPN22 (p=2.89xl0-^^) risk alleles and 
again revealed the ORs associated with either of the two 
NKAPL risk alleles to be lower, but for the PTPN22 risk allele 
to be higher, in a comparison of SE positive cases to controls 
versus SE negative cases to controls. 

DISCUSSION 

The NKAPL region emerged as a candidate RA risk locus in the 
context of our prior GWAS data providing strong evidence for 
association of SNPs across this locus with RA in a Canadian 
and US study population. Because the association was sup- 
ported in subsequent meta-analysis incorporating this and four 
more G\AA.S datasets,"^^ we undertook fine mapping and 



Table 5 HLA-DRBl interaction with NKAPL or PTPN22 risk alleles 



SNP 




p Value 


OR (95% CI) 


rs35656932 


13.44 


2.0 xlO-"^ 


0.67 (0.54 to 0.83) 


rsl 3208096 


11.27 


8.0x10-"^ 


0.69 (0.55 to 0.86) 


rs2476601 


7.15 


7.5x10-^ 


1.34 (1.08 to 1.65) 



Logistic regression analysis using case-only design was used to test for interactions 
between HLA-DRBl SE alleles and the NKAPL rs35656932 and rsl 3208096 SNPs or 
the PTPN22 rs2476601 risk variant. 
NKAPL, NF-kB activating protein-like. 



conditional analyses to screen for risk variants with stronger 
and/or independent signals of disease association. By genotyp- 
ing our Canadian RA case/control cohort for 101 SNPs across 
the locus, we have identified six variants for which association 
with RA reaches a conservative level of genome-wide signifi- 
cance (p<5.7xl0~^). Haplotype analysis reveals these six var- 
iants to all lie within one of these haplotype blocks in a region 
encompassing the NKAPL and three zing finger transcription 
factor genes. These genes are in strong LD with one another, 
but results of stepwise and conditional logistic regression ana- 
lyses indicate that both ZNF193 rs35656932 and NKAPL 
rsl3208096 SNPs contribute to risk for RA, associations for 
these two markers remaining significant after conditioning for 
each other associated SNP and results of stepwise logistic 
regression also suggesting independent effects of these SNPs on 
risk. These two associations were also replicated, albeit at 
modest levels of significance, in an independent US-based 
cohort. 

Interpreting effects of the NKAPL locus on RA risk is compli- 
cated by the location of this locus in a chromosomal region 
(6p22.1) upstream of the HLA class II genes. While the NKAPL 
locus maps about 4306 kb away from the HLA-DRBl gene and 
1386 kb upstream from the telomeric end of the HLA region, 
the extensive LD across the region raises the possibility that 
the NKAPL association signals reflect LD with HLA-DRBi SE 
alleles. However, effects of NKAPL as well as DRBi SNPs on 
disease risk were revealed here by the logistic regression ana- 
lyses conditioning on either of two HLA-DRBl* 0401 tag SNPs. 
Analysis of pairwise LD between each of the most strongly 
associated NKAPL SNPs and the HLA-DRBl tag SNPs also pro- 
vided no evidence for LD (r^<0.01) between SNPs at these 
respective loci and a stepwise logistic regression analysis com- 
bining the top six candidate SNPs from the fine-mapping study, 
379 HLA region SNPs from the GWAS and the imputed HLA 
alleles further support contribution of the NKAPL locus to risk 
for RA. These findings are consistent with other data suggest- 
ing that the MHC locus contains loci in addition to HLA-DRBl 
that confer risk to RA.^^ Importantly primary association of 
this locus with SE negative disease also implies interaction 
between the NKAPL and HLA-DRBl risk alleles, a possibility 
supported by the results of case-only and multinomial logistic 
analyses showing very significant negative interaction effects 
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between HLA-DRBl and each of two NKAPL risk alleles. While 
both the NKAPL and the observed PTPNZZ-HLA-DRBl inter- 
action effects on risk require further evaluation in relation to 
their biological significance^ these data provide new insights 
into the complex effector interactions that may link risk geno- 
types to disease pathogenesis in RA. 

The current data identify two SNPs, rsl3208096 and 
rs35656932, as the major drivers of the association signal at the 
NKAPL locus. Among these^ rsl3208096 maps 1787bp upstream 
of NKAPL gene expressed in many tissues^ including most 
immune cell populations.^^ The gene encodes a 402 amino acid 
nuclear protein highly homologous to the NKAP that functions 
as a transcriptional repressor of NOTCH signalling in thymo- 
cytes and is required for haemopoietic stem cell maintenance 
and survival. NKAPL is not functionally characterised^ but 
shares with NKAP a domain critical to NKAP roles in transcrip- 
tional repression^^ and recent data from genome-wide annota- 
tion of transcriptional regulators (ENCONE:UCSC genome 
browser: http://genome.ucsc.edu/) reveal the rsl3208096 SNP to 
be located in a region containing putative transcriptional regula- 
tory histone marks. 

The second SNP that drives the association signal at the 
NKAPL locus, rs35656932, maps 1741bp upstream of the ZNF19S 
gene encoding the zinc finger transcription protein, ZNF193. 
Little is known about the functions of ZNF193 or the ZNF187 
and ZNF307 proteins encoded by the nearby genes. However, 
ZNF307 has transcriptional repressor activity and appears to 
target NF-kB, raising the intriguing, albeit highly speculative pos- 
sibility that several genes at the NKAPL locus influence RA risk 
via effects on NF-kB signalling.^^ 

Thus, while the current data do not identify the disease- 
causal allele at the NKAPL locus, our findings provide compel- 
ling evidence that this locus confers risk for RA and that the 
variants accounting for this association signal emanate from 
highly plausible candidate genes likely to influence NF-kB as 
well as HLA-DRB 1 -moduldited immune cell responses already 
implicated in RA risk and pathogenesis. 
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